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We introduce a technique for establishing and amplifying gaps between parameters of network 
coding and index coding problems. The technique uses linear programs to establish separations 
■ between combinatorial and coding-theoretic parameters and applies hypergraph lexicographic 

products to amplify these separations. This entails combining the dual solutions of the lexico- 
graphic multiplicands and proving that this is a valid dual solution of the product. Our result is 
general enough to apply to a large family of linear programs. This blend of linear programs and 
lexicographic products gives a recipe for constructing hard instances in which the gap between 
combinatorial or coding-theoretic parameters is polynomially large. We find polynomial gaps in 
c/5 | cases in which the largest previously known gaps were only small constant factors or entirely 

O . unknown. Most notably, we show a polynomial separation between linear and non-linear net- 

work coding rates. This involves exploiting a connection between matroids and index coding 
to establish a previously unknown separation between linear and non-linear index coding rates. 
We also construct index coding problems with a polynomial gap between the broadcast rate and 
the trivial lower bound for which no gap was previously known. 
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The problem of Network Coding, introduced by Ahlswede et al [3] in 2000, asks for the maximum 
rate at which information can be passed from a set of sources to a set of targets in a capacitated 
network. In practice, there are many examples where network coding provides faster transmission 
rates compared to traditional routing, e.g. [18] details a recent one in wireless networks. However, 



despite tremendous initial success in using network coding to solve some broadcast problems (those 



in which every receiver demands the same message), very little is known about how to compute or 
approximate the network coding rate in general. (See [29] for a survey of the topic.) 

In the absence of general algorithms for solving network coding, attention has naturally turned 
to restricted models of coding (e.g. linear functions between vector spaces over finite fields) and to 
approximating network coding rates using graph-theoretic parameters (e.g. minimum cut [2] and 
the independence number [4]). Several of these variants provide bounds on the network coding 
rate, but the worst-case approximation factor of these bounds remains unknown. For example, it 
is known that there exists a network in which non-linear network coding can achieve a rate which 
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exceeds the best linear network code by a factor of but it is not known whether this gapo 

can be improved to n 1_e , or even possibly to 0(n). 

In this paper we introduce a general technique for amplifying many of these gaps by combining 
linear programming with hypergraph product operations. For instance, this enables us to construct 
a family of network coding instances with n messages, in which the rate of the best non-linear 
network code exceeds the rate of the best (vector-)linear network code by a factor of at least n e . 
A crucial ingredient in our technique is index coding 0[8], a class of communication problems in 
which a server holds a set of messages that it wishes to broadcast over a noiseless channel to a set 
of receivers. Each receiver is interested in one of the messages and has side-information comprising 
some subset of the other messages. The objective is to devise an optimal encoding scheme (one 
minimizing the broadcast length) that allows all the receivers to retrieve their required information. 
Following [4j, we use f3 to denote the limiting value of the information rate (i.e., ratio of broadcast 
length to message length) of this optimal scheme, as the message length tends to infinity. 

In our framework, index coding is most useful for isolating a sub-class of network coding prob- 
lems that can be combined using lexicographic products. However, it is also an important and 
well-studied problem in its own right. Index coding is intimately related to network coding in 
general. It is essentially equivalent to the special case of network coding in which only one edge 



index coding, thus implying that index coding captures much of the difficulty of network coding. 

Index coding is also intricately related to other well-studied areas of mathematics. Connections 
between matroids and index coding were established in [24J; for example, that paper shows that 
realizability of a matroid over a field ¥ is equivalent to linear solvability of a corresponding index 
coding problem. Index coding is also closely connected to graph theory: a special case of index 
coding can be described by an undirected graph G, representing a communication problem where a 
broadcast channel communicates messages to a set of vertices, each of whom has side-information 
consisting of the neighbors' messages. Letting a(G),x(G) denote the independence and clique-cover 
numbers of G, respectively, one has 



The first inequality above is due to an independent set being identified with a set of receivers with 
no mutual information, whereas the last one due to [7, 8j is obtained by broadcasting the bitwise 
XOR of the vertices per clique in the optimal clique-cover of G. As one consequence of the general 
technique we develop here, we settle an open question of [1] by proving that a(G) can differ from 
/3(G); indeed, we show that their ratio can be as large as n 0,139 . 

1.1 Contributions 

We present a general technique that amplifies lower bounds for index coding problems using 
lexicographic hypergraph products in conjunction with linear programs that express information- 

lr The literature on network coding distinguishes between linear network codes, in which the messages are required 
to be elements of a finite field, and vector-linear network codes, in which the messages are elements of a finite- 
dimensional vector space over a finite field. Linear coding is weaker, and a gap of size n 1_e is known [22]. Vector-linear 
coding is much more powerful, and no gap larger than lf/fO was known prior to our work. 

2 The unique finite-capacity edge represents the broadcast channel. Each sender is connected to the tail of this 
edge, each receiver is connected to its head, and each receiver has incoming edges directly from a subset of the senders, 
representing the side-information. 




Additionally, [25] shows that linear network coding can be reduced to linear 



a{G) < /3(G) < x(G) . 
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theoretic inequalities. The use of such linear programs to prove lower bounds in network coding 
theory is not new, but, perhaps surprisingly, they have not gained widespread use in the analysis of 
index coding problems. We give an information-theoretic linear program, whose solution, b, gives 
the best known lower bound on f3. However, our main innovation is the insight that this linear 
programming technique can be combined with the combinatorial technique of graph products to 
yield lower bounds for sequences of index coding and network coding problems. Specifically, we 
provide a lexicographic product operation on index coding problems along with an operation that 
combines dual solutions of the corresponding two linear programs. We show that the combined 
dual yields a dual solution of the linear program corresponding to the lexicographic product. Using 
this operation, we demonstrate that index coding lower bounds proven using linear programming 
behave supermultiplicatively under lexicographic products. This technical tool enables us to prove 
some new separation results answering open questions in the field. 

Our technique not only applies to the standard linear programs used in network information 
theory (those that express entropy inequalities such as submodularity) but to any family of linear 
programs constructed using what we call a tight homomorphic constraint schema. In particular, if 
one can develop a tight homomorphic constraint schema that applies to a restricted class of codes 
(e.g. linear) then it becomes possible to prove lower bounds for this class of codes and amplify them 
using lexicographic products. We pursue this approach in establishing a large multiplicative gap 
between linear and non-linear network coding. 

Theorem 1.1. Lower bounds for index coding problems can be proven by solving a linear program 
whose constraints are valid for the class of coding functions being considered. If the linear program 
is constructed using a tight homomorphic constraint schema (see Section^, then its optimum is 
supermultiplicative under the lexicographic product of two index coding problems. 

To separate linear from non-linear coding, we first produce a pair of linear inequalities that are 
valid information inequalities for tuples of random variables defined by linear functions over fields 
of odd (resp., even) characteristic, but not vice-versa. We obtain these inequalities by considering 
the Fano and non-Fano matroids; the former is a matroid that is only realizable in characteristic 2, 
while the latter is only realizable in odd characteristic and in characteristic 0. For each of the two 
matroids, we are able to transform a proof of its non-realizability into a much stronger quantitative 
statement about dimensions of vector spaces over a finite field. This, in turn, we transform into a 
tight homomorphic constraint schema of valid information inequalities for linear random variables. 

We then use the connection between matroids and index coding [lll [T2l[24] and these inequalities 
to give a pair of index coding instances where the best non-linear coding rate is strictly better than 
the best linear rate over a field of odd (resp., even) characteristic. We do this by establishing a 
general theorem that says that for a matroid M, and an inequality that is violated for the rank 
function of M, there is an index coding problem for which the bound obtained by adding this 
inequality to the LP is strictly greater than b. 

We can now plug the constraint schema into our lexicographic product technique and apply it 
to these two index coding problems to yield the aforementioned separation between (vector-)linear 
and non-linear network coding. 

Theorem 1.2. There exists an explicit family of network coding instances (based on index coding 
instances) with n messages and some fixed e > such that the non-linear rate is £l(n £ ) times larger 
than the linear rate. 
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The largest previously known gap between the non-linear and linear rates for network coding 
was a factor of ^ (|12j). No separation was known between these parameters for index coding 
(see [4l|22] for related separation results focusing on the weaker setting of scalar linear codes). 

As explained above, given any index coding problem G we can write down an LP whose con- 
straints are based on information inequalities that gives a lower bound on (3. It is the best known 
lower bound, and in many cases, strictly better than any previously known bound. Notably, we 
can show that the broadcast rate of the 5-cycle is at least |, giving the first known gap between 
the independence number a (which equals 2 for the 5-cycle) and the broadcast rate f3. Amplifying 
this gap using lexicographic products, we can boost the ratio /3/a to grow polynomially with n in 
a family of n-vertex graphs. 

Theorem 1.3. There exists an explicit family of index coding instances with n messages such that 
(3(G) is at least Q(n s ) times larger than a{G), where 5=1 — 21og 5 (2) ~ 0.139. 

The remainder of the paper is organized as follows. In Section [2] we give a formal definition 
of index coding and the lexicographic product of two index coding problems. In Section [3] we 
describe a general class of LPs and prove they behave supermultiplicatively under lexicographic 
products. Section U] is devoted to the proof of Theorem II .31 In Section [5] we give a construction from 
matroids to index coding and prove a number of connections between properties of the matroid 
and the parameters of the corresponding index coding problem. Finally, in Section [6] we establish 
inequalities that are valid for linear codes over fields of odd (resp., even) characteristic and then 
use these to prove Theorem 11.21 

2 Definitions 

An index coding problem is specified by a directed hypergraph G = (V, E), where elements of V 
are thought of as messages, and E C V x 2 V is a set of directed hyperedges (v, S), each of which 
is interpreted as a receiver who already knows the messages in set S and wants to receive message 
v . Messages are drawn from a finite alphabet S, and a solution of the problem specifies a finite 
alphabet Xp to be used by the public channel, together with an encoding scheme £ : E^l — > £p 
such that, for any possible values of {x v ) v& y, every receiver (v,S) is able to decode the message 
x v from the value of £(x) together with that receiver's side information. The minimum encoding 
length i = |~log 2 |Sp|] for messages that are t bits long (i.e. S = {0, 1}*) is denoted by (3t(G). As 
noted in [22], due to the overhead associated with relaying the side- information map to the server 
the main focus is on the case t > 1 and namely on the following broadcast rate. 

(3(G) = lim = inf (2.1) 

t— >oo t t t 

(The limit exists by subadditivity.) This is interpreted as the average asymptotic number of broad- 
cast bits needed per bit of input, that is, the asymptotic broadcast rate for long messages. We 
are also interested in the optimal rate when we require that £ is a finite-dimensional vector space 
over a finite field F, and the encoding function is linear. We denote this by A F , and we denote the 
optimal linear rate over any field as A. 

A useful notion in index coding is the following closure operation with respect to G, a given 
instance of the problem: for a set of messages Scy, define 

d(S) = cl G (S) = Su{x\ 3(x, T)eE s.t. T <Z S} . (2.2) 
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The interpretation is that every message x £ cl(S) can be decoded by someone who knows all of the 
messages in S in addition to the broadcast message. In Section [5] when we discuss a transformation 
that associates an index coding problem to every matroid, the closure operation defined in this 
paragraph — when specialized to the index coding problems resulting from that transformation — 
will coincide with the usual matroid-theoretic closure operation. 

We next define the lexicographic product operation for directed hypergraphs, then proceed to 
present Theorem 12 . 2 1 which demonstrates its merit in the context of index coding by showing that (3 
is submultiplicative for this operation. The proof gives further intuition for the product operation. 

Definition 2.1. The lexicographic product of two directed hypergraphs G, F, denoted by G • F, 
is a directed hypergraph whose vertex set is the cartesian product V(G) x V(F). The edge set of 
G • F contains a directed hyperedge e for every pair of hyperedges (ec,ef) £ E(G) x E(F). If 
ec = (wg,Sg) and = (wf,Sf), then the head of e = {e.Q, ep) is the ordered pair (wg,wp) and 
its tail is the set (Sg x V(F)) U ({u>g} x Sf). Denote by G' n the n-fold lexicographic power of G. 

Remark. In the special case where the index coding problem is defined by a graph! the above 
definition coincides with the usual lexicographic graph product (where G • F has the vertex set 
V(G)xV(F) and an edge from (it, v) to (it', v') iff either (u, it') £ E(G) or it = u' and (v, v') £ E(F)). 

Theorem 2.2. The broadcast rate is submultiplicative under the lexicographic product of index 
coding problems. That is, (3(G • F) < /3(G) (3(F) for any two directed hypergraphs G and F. 

Proof. Let e > and, recalling the definition of (3 in (|2.ip as the limit of /3t/t, let K be a sufficiently 
large integer such that for all t > K we have (3 t (G)/t < /3(G) + e as well as f3 t (F)/t < (3(F) + e. 
Let E = {0, 1}^" and consider the following scheme for the index coding problem on G • F with 
input alphabet £, which will consist of an inner and an outer code. 

Let Ep denote an encoding function for F with input alphabet £ achieving an optimal rate, i.e. 
minimizing log(|£p|)/log(|£|). For each v £ V(G), the inner code applies £p to the |V(F)|-tuple of 
messages indexed by the set {v} x V(F), obtaining a message m v . Note that our assumption on |E| 
implies that the length of m v is equal to K' for some integer K' such that K < K' < (/3(F) +e)K. 
Next, let Eg denote an optimal encoding function for G with input {0, 1}^ . The outer code applies 
Eg to {mv} V £V(G) an d the assumption on K ensures its output is at most ((3(G) + e)K' bits long. 

To verify that the scheme is a valid index code, consider a receiver in G • F represented by 
e = ((wg,wf), (Sg x V(F)) U ({vjg} x Sp)). To decode (wg,wp), the receiver first computes m v 
for all v £ Sg- Since Eg is valid for G, receiver e can compute m WG , and since Ep is valid for F, 
this receiver can use the messages indexed by {wq} x Sp along with m WG to compute (wg,wf). 

Altogether, we have an encoding of K bits using at most ((3(F) + e)((3(G) + e)K bits of the 
public channel, and the required result follows from letting e —¥ 0. ■ 

3 Linear programming 

In this section we derive a linear program whose value constitutes a lower bound on the broadcast 
rate, and we prove that the value of the LP behaves supermultiplicatively under lexicographic 
products. In fact, rather than working with a specific linear program, we work with a general class 

3 When there are n messages and exactly n receivers, w.l.o.g. receiver i wants the message Xi and one can encode 
the side-information by a graph on n vertices which contains the edge (i,j) iff receiver i knows the message Xj. 



5 



min 


z$ 






max |/| • w - J2sgt c stXst 




D. L. 


z i 


- 1 71 
— 


\ w ) 


S - 1 - 2^iq a qSyq t z^tdS x st 




VScT 


z T 




(x) 


J2q a q$Vq + Et/B X 0T = 


1 




Az 


> o 


(y) 




w = 










x,y > 





Figure 1: The LP and its dual. 



of LP's having two types of constraints: those dictated by the network structure (which are the 
same for all LP's in the general class), and additional constraints depending only on the vertex set, 
generated by a constraint schema, i.e. a procedure for enumerating a finite set of constraints given 
an arbitrary finite index set. We identify some axioms on the constraint schema that constitute a 
sufficient condition for the LP value to be supermultiplicative. An example of a constraint schema 
which is important in network information theory is submodularity. For a given index set I, the 
submodularity schema enumerates all of the constraints of the form z$ + Zt > -zsnT + z sut where 
S, T range over subsets of /. 

Now we explain the general class of LPs which behave submultiplicatively under the lexico- 
graphic product and give bounds on /3. Given an index code, if we sample each message indepen- 
dently and uniformly at random, we obtain a finite probability space on which the messages and 
the public channel are random variables. If S is a subset of these random variables, we will denote 
the Shannon entropy of the joint distribution of the variables in S by H(S). If S C T C cl(S') 
then every message in T \ S can be decoded given the messages in S and the public channel p, and 
consequently H(S U {p}) = H(T U {p}). More generally, if we normalize entropy (i.e. choose the 
base of the logarithm) so that H(x) = 1 for each message x, then for every S C T we have 

H(T U M) - H(S U {p}) < \T \ cl(S)\ = c ST , (3.1) 

where the above is taken as the definition of cst- This implies that for any index code we obtain 
a feasible solution of the primal LP in Figure Q] by setting zs = H(S U {p}) for every S. Indeed, 
the first constraint expresses the fact that the value of p is determined by the values of the n 
messages, which are mutually independent. The second constraint was discussed above. The final 
line of the LP represents a set of constraints, corresponding to the rows of the matrix A = (a q s), 
that are universally valid for any tuple of random variables indexed by the message set /. For 
instance, it is well known that the entropy of random variables has the submodularity property: 
H(S) + H(T) > H(S U T) + H(S n T) if S, T are any two sets of random variables on the same 
sample space. So, for example, the rows of the constraint matrix A could be indexed by pairs of 
sets S, T, with entries in the (5, T) row chosen so that it represents the submodularity constraint 
(namely a q s = a g x = 1, a q sc\T = flqSur = — 1 and all other entries of row a of A are zero). Noting 
that H{{p}) < j5{G) we can altogether conclude the following theorem. 

Theorem 3.1. For an index coding problem G, let 25(G) be the LP in Figured when A represents 
the submodularity constraints and let b(G) be its optimal solution. Then b(G) < (3(G). 
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It is known that entropies of sets of random variables satisfy additional linear inequalities 
besides submodularity; if desired, the procedure for constructing the matrix A could be modified to 
incorporate some of these inequalities. Alternatively, in the context of restricted classes of encoding 
and decoding functions (e.g. linear functions) there may be additional inequalities that are specific 
to that class of functions, in which case the constraint matrix A may incorporate these inequalities 
and we obtain a linear program that is valid for this restricted model of index coding but not valid 
in general. We will utilize such constraints in Section [6] when proving a separation between linear 
and non-linear network coding. 

Definition 3.2. A constraint schema associates to each finite index set I a finite set Q(I) (indexing 
constraints) and a matrix A(I) with rows indexed by Q(I) and columns indexed by V(I), the power 
set of /. In addition, to each Boolean lattice homomorphismEl h : V{T) — > V{J) it associates a 
function h* : Q(I) ->■ Q(J). 

Let 1 be the "P(I)-indexed vector such that I5 = 1 for all S, and let lj be the vector where 
(lj)s = 1 for all S containing i and otherwise (lj)s = 0. We say that a constraint schema is tight 
if A(I)1 = A(I)li = for every index set / and element i £ I. 

Given h and h* let Ph and Qh be matrices representing the linear transformations they induce 
on M.^^ — >• M.^^ and JR®W — > M.^ J \ respectively. That is, Ph and Qh have zeros everywhere 
except (Ph)h(S)S = 1 an d {Qh)h,(q)q = 1- We say that a constraint schema is homomorphic if it 
satisfies A{ J) T Q h = P h A(I) T for every Boolean lattice homomorphism h : V(I) — > V(J). 

Example 3.3. Earlier we alluded to the submodularity constraint schema. This is the constraint 
schema that associates to each index set / the constraint-index set Q(I) = V(I) x V(I), along 
with the constraint matrix A(I) whose entries are as follows. In row (S, T) and column U, we have 
an entry of 1 if U = S or U = T, an entry of— lifC/ = 5nTorC/ = «S'UT, and otherwise 0. 
(If any two of S, T, S n T, S U T are equal, then that row of A(I) is set to zero.) It is easy to 
verify that A(I)1 = A(J)lj = for all i 6 J, thus the schema is tight. For a homomorphism h, 
the corresponding mapping of constraint sets is h*(S,T) = (h(S),h(T)). We claim that, equipped 
with this mapping of h — > h*, the constraint schema is homomorphic. Indeed, to verify that 
A( J) T Qh = PhA(I) T take any two sets S,T C I and argue as follows to show that u = Ph A{I) T es,T 
and v = A{ J) T Qh es,T are identical (here and henceforth ex,Y denotes the standard basis vector 
of having 1 in coordinate (X,Y) for X, Y C I). First observe that A(I) T es,T is the vector 

u € R^u-O which has entries everywhere except us = ut = 1 an d usut = usnT = — 1 provided 
that S <^T <£. S, otherwise u = 0. As such, u = PhU has entries everywhere except 

u h(S) = u h(T) = 1 i U h (SUT) = Uh(SllT) = ~ 1 

provided that S ^ T <t S and furthermore h(S) ^ h(T) ^ h(S), otherwise u = (for instance, 
if S C T then u = and so u = 0, whereas if h(S) C h(T) then u belongs to the kernel of Ph)- 
Similarly, Qhes,T = ^h(S),h( T ) anc ^ therefore v = A(J) T eh(s),h(T) has entries everywhere except 

v h(s) = v h(T) = 1 , v h (s)uh(T) = v h (s)nh(T) = -1 

provided that h(S) <£. h(T) ^ h(S), otherwise v = 0. To see that u = v note that if h(S) C h(T) 
then u = v = 0, and if 5 C T then again we get h(S) C h(T) due to monotonicity (recall that h is a 

4 A Boolean lattice homomorphism preserves unions and intersections, but does not necessarily map the empty set 
to the empty set nor the universal set to the universal set, and does not necessarily preserve complements. 
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lattice homomorphism) and so u = v = 0. Adding the analogous statements obtained from reversing 
the roles of S, T, it remains only to verify that u = v in case h(S) h(T) h(S), which reduces by 
the above definitions of u and v to requiring that h(SUT) = h(S)Uh(T) and h(SnT) = h(S)nh(T). 
Both requirements are satisfied by definition of a Boolean lattice homomorphism, and altogether 
we conclude that the submodularity constraint schema is homomorphic. 

Theorem 3.4. Let A be a tight homomorphic constraint schema. For every index coding problem 
let p(G) denote the optimum of the LP in Figure[l\when L = V(G) and the constants cst are defined 
as in (|3.ip . Then for every two index coding problems G and F , we have p{G • H) > p(G) p(F) . 

Proof. It will be useful to rewrite the constraint set of the dual LP in a more succinct form. First, 
if x is any vector indexed by pairs S, T such that 5 C T C J, let Vx £ IR^W denote the vector such 
that for all S, (Vi)s = ^2tdS x st — YItcS x ts- Next, for a set S C I, let es denote the standard 
basis vector vector in M^^) whose S component is 1. Then the entire constraint set of the dual LP 
can be abbreviated to the following: 

A T y + Vx + wei = e , x,y>0. (3.2) 

Some further simplifications of the dual can be obtained using the fact that the constraint schema 
is tight. For example, multiplying the left and right sides of (j3.2j) by the row vector 1 T gives 

l T A T y + l T Vx + w = 1. 

By the tightness of the constraint schema 1 T A T = 0. It is straightforward to verify that l T Vx = 
and after eliminating these two terms from the equation above, we find simply that w = 1. Similarly, 
if we multiply the left and right sides of (13. 2j) by the row vector lj and substitute w = 1, we obtain 
lj A T y + l^Vx + 1 = and consequently (again by the tightness) we arrive at 1 = -lJVx. At 
the same time, — lJVx = ^ sct %st by definition of Vx, hence summing over all i € I yields 

i£T\S 

\L\ = \T\S\xst- 

SCT 

Plugging in this expression for |/| and w = 1, the LP objective of the dual can be rewritten as 
l J l ~ ^2 °stxst = ^2(\T\S\- c S t) x S t = X, l Tn ( cl ( 5 ') \ s )\ x st, 

SCT SCT SCT 

where the last equation used the fact that cst = \T\ cl(5)|. We now define 

d(S,T) = \T n {cl(S) \ S)\ 
and altogether we arrive at the following reformulation of the dual LP. 

max X/ScT d(S,T) xst 

s.t. A T y + Vx = e0 — e/ (3.3) 
x, y > . 

Now suppose that (£ G , r] G ), (£ F , n F ) are optimal solutions of the dual LP for G, F, achieving 
objective values p(G) and p(F), respectively. (Here £,77 play the role of x,y from (13. 3p . resp.) 
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We will show how to construct a pair of vectors (£ G ' F , rj G ' F ) that is feasible for the dual LP of 
G • F and achieves an objective value of at least p(G) p(F). The construction is as follows. Let 
g : P(V(G)) V(V{G • F)) be the mapping g(X) = X x V(F). For sets S C T C V(G), let 
/j5T . 7?(V(F)) -> 7>(F(G • F)) be the mapping /i ST (X) = (T x X) U (S x ^(F)). Observe that 
both mappings are Boolean lattice homomorphisms. 

To gain intuition about the mappings g, h ST it is useful to think of obtaining the vertex set of 
G • F by replacing every vertex of G with a copy of F. Here g({v}) maps the vertex v in G to 
the copy of F that replaces v. The mapping /i^du}) maps a vertex u in F to the vertex u in the 
copies of F that replace vertices in T, and then adds the set {u} x V(F). 

Recall that Definition [372] associates two matrices P^, to every Boolean lattice homomorphism 
h : P(-f) — > V(J). It is also useful to define a matrix Rh as follows: the columns and rows of Rh are 
indexed by pairs S C T C 7 and X C K C J, respectively, with the entry in row XY and column 
ST being equal to 1 if X = h(S) and Y = h(T), otherwise 0. Under this definition, 

V(R h x) = P h Vx for any x € . (3.4) 

Indeed, if x = es,T for some S C T C 7 then Ves,T = &s — ct and so Ph^s,t = e h(s) ~~ e h(T)> 
whereas V(R h e s>T ) = V(e h ( 5)i/l(T) ) = e h(5) - e h(T) . 

We may now define 

£ G,F =E^ G )^^ F ), (3-5) 

= Q 9 v G + E ^) • ( 3 - 6 ) 

SCT 

In words, the dual solution for G • F contains a copy of the dual solution for F lifted according 
to h ST for every pair S C T and one copy of the dual solution of G lifted according to g. The 
feasibility of {£ G ° F , i] G ' F ) will follow from multiple applications of the homomorphic property of 
the constraint schema and the feasibility of {£, F ,rj F ) and (£ G ,r/ ), achieved by the following claim. 

Claim 3.5. The pair ,r] G ' F ) as defined in (|3.5p . ()3.6p is a feasible dual solution. 

Proof. The matrices Q g , F^st, Q^st all have {0, 1}- valued entries thus clearly £ F ,rj F > 0. 
Letting A = A(G«F), we must prove that A T rj + V£ G,F = e$ — ey( G .^). Plugging in the values 
of (C G ' F ,r] GmF ) we have 

.4 + ^ G ' F = A T Q gV G + £ (e G ) ST (A T Q h sTri F ) + E (e G W V(F, ST ^) , 

SCT SCT 

= P 9 A(G) T r, G + £ (£ G )st (f^(F) V + V(F, sr . (3.7) 

SCT 

where the second equality applied the homomorphic property of the constraint schema. To treat 
the summation in the last expression above, recall (I3.4p which implies that 

P h sTA(F) T V F + V(R h s T Z F ) = P h s T A(F) T V F + P h srVZ F = P h s T (e 9 - e v(F) ) , (3.8) 

with the last equality due to the fact that {^ F ,r] F ) achieves the optimum of the dual LP for F. 
Recalling that Ph&s = e h(S) f° r an y h and combining it with the facts h (0) = S x V(F) and 
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g (S) = S x V(F) gives P h s T e % = e SxV{F ) = P g e s . Similarly, since h ST (V(F)) = Tx V(F) we have 
P h STey(p-) = exxV(F) = Pg e T, and plugging these identities in (13.8|) combined with (|3.7p gives: 



A T r] G ' F + V£ 



G»F 



P. 



A(G) t v g +Y,(H G )st (es-er) 

SCT 



Collecting together all the terms involving eg for a given S 1 G V(I), we find that the coefficient of 
es is £tds(£ G W - £ rc5 (£ G )sT = (V£ G )s- Hence, 



A T rj G ' F + V£ 



G»F 



Pa 



A(G) T V G + V£ G j = P 9 [e - e v{G) ] = e - e v(G . F) 



where the second equality was due to (£ , r/ ) achieving the optimum of the dual LP for G. 



To finish the proof, we must evaluate the dual LP objective and show that it is at least p(G) p(F), 
as the next claim establishes: 

Claim 3.6. The LP objective for the dual solution given in Claim [375\ has value at least p(G) p(F). 

Proof. To simplify the notation, throughout this proof we will use K, L to denote subsets of V{G*F) 
while referring to subsets of V(G) as S,T and to subsets of V(F) as X, Y. We have 

£ d(K,L)(f F ) KL = d(K,L) Y,(t G )sT(R h sTe)KL 

KcL KcL SCT 

= £(£ G )st( £ d(K,L) (R hST £ F ) KL ) 

ScT ^ KCL ' 

= E«<V( £ d(h ST (X),h ST (Y)) ^)xy) , (3.9) 

SCT ^-XCY ' 

where the last identity is by definition of Rh- 

At this point we are interested in deriving a lower bound on d(h ST (X), h ST (Y)^, to which end 
we first need to analyze clc%F(h ST (X)). Recall that E(G • F) consists of all hyperedges of the 
form (w,K) with w = (wq,Wf) and K = (Wq x V(F)) U ({wq} x Wp) for some pair of edges 
(w G , Wg) e E(G) and (w F , W F ) G E{F). We first claim that for any S C T and X C V(F), 

cl G . F {h ST (X)) \ h ST (X) D ( (cl G (5) \ S) n T) X ( cl F (X) \ x) . (3.10) 

To show this, let L C V(G • -F) denote the set on the right side of (|3.10p . Note that L contains 
no ordered pairs whose first component is in S or whose second component is in X, and there- 
fore L is disjoint from h ST (X) = (T x X) U (S x V(F)). Consequently, it suffices to show that 
c\g»f {h ST (X)) D L. Consider any w = (wg,w f ) belonging to L. As u> G G cl G (iS) \ 5, there 
must exist an edge (wg,Wg) G F(G) such that W(j C S. Similarly, there must exist an edge 
(w F , Wp) G -E(-F) such that W F Q X. Recall from the definition of L that {w G } Q T. Now letting 
K = (W G x V(F)) U ({w G } x W», we find that if C (S x V(F)) U (T x I) = h ST (X) and that 
(w,K) G F(G • F), implying that w G cIg.f (h ST (X)) as desired. 

Let X = h ST (X) and Y = h ST (Y), and recall that d(X,Y) is defined as | (cW(X) \X) DY\. 
Using (f3TT0|) and noting that Y D (T x F) we find that 

(ci G . F (x) \i)ny d ( (d G (5) \s)nr)x( (ci F (x) \ x) n y) 
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and hence 

d(X,Y) > |(cl G (S)\5)nT| • \(cl F (X)\X)nY\ = d(S,T)d(X,Y). 
Plugging this bound into (|3.9p we find that 



KcL SCT XcY 



d(K,m G ' F )KL > ^2(ti G )sT Yl d(S,T)d(X,Y)(e)xY 

SCT XCY 

( Y, d(S,T)(f) ST ) ( £ d(X,Y)(e)x Y ) = p{G)p(F) 

^ SCT ' ^-XCY 



as required. 

Combining Claims 13.51 and 13.61 concludes the proof of the Theorem [37 



Remark. The two sides of ()3.10|) are in fact equal for any non-degenerate index coding instances 
G and F, namely under the assumption that every (wq, Wg) £ F(G) has u>g ^ Wq (otherwise this 
receiver already knows the required wg and may be disregarded) and Wq ^ (otherwise the public 
channel must include wq in plain form and we may disregard this message), and similarly for F. 
To see this, by definition of c\ G , F (-) and the fact that h ST {X) = (T x X) U (S x V(F)) it suffices 
to show that every edge (w,K) G E(G»F) with K C h ST (X) satisfies w G (cl G (S)nT) x cl F (X). 
Take {w,K) G E(G • F) and let (w G ,W G ) € E(G) and {w F ,W F ) G E(F) be the edges forming 
it as per Definition 12.11 of the lexicographic product. A prerequisite for K C h ST (X) is to have 
wq G T as otherwise {u>g} x Wf 2 h ST (X) (recall that S C T and that Ifp 7^ 0). Moreover, as 
X is strictly contained in V(F) we must have Wg Q S in order to allow Wg x ^(-f 1 ) != h (X), 
thus (using the fact that u>g ^ Wg and so ^ S) we further require that W F C X. Altogether 
we have Wg C 5, Wp C X and !«g G T, hence (wg,w f ) G (cIg(S') n T) x c1f(X) as required. 



4 Separation between a and /3 

To prove Theorem 11.31 we start by using Theorem 13.11 to show that P{C§) > ol{C§) where C5 
is the 5-cycle. Then we apply the power of Theorem 13.41 to transform this constant gap on C5 to a 
polynomial gap on C| . 

First we show that /3(Cs) > 6(65) > |. We can show that b{C§) > § by providing a feasible 
dual solution for the LP 53 with value |. This can easily be achieved by listing a set of primal 
constraints whose variables sum and cancel to show that z$ > |. Labeling the vertices of C5 by 
1,2,3,4,5 sequentially, such a set of constraints is given below. It is helpful to note that in an 
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index coding problem denned by an undirected graph, x € cl(5) if all the neighbors of x are in 5. 



2 > z {h3} - z % 

2 > ^{2,4} - z 9 

1 > Z{5} ~ Z® 

> ^{1,2,3} - Z {1,3} 

> ^{2,3,4} ~ ^{2,4} 
^{2,3,4} + Z {1,2, 3} > ^{2,3} + ^{1,2,3,4} 
^{2,3} + z {5} > z + z {2,3,5} 

— Z {1, 2,3,4,5} ~~ Z {1,2, 3,4} 

> £{1,2,3,4,5} - ^{2,3,5} 
£{1,2,3,4,5} = 5 
^{1,2,3,4,5} = 5 

Applying Theorem 13.41 we deduce that for any integer k > 1 the fc-th lexicographic power of C5 
satisfies /3(C|) > b(C k ) > (|) . Furthermore, a(Cs) = 2 and it is well known that the independence 
number is multiplicative on lexicographic products and so a(C k ) = 2 k . Altogether, Cf is a graph 
on n = 5 k vertices with a = n log s( 2 ) and (3 > n 1_log 5( 2 ), implying our result. 

5 Matroids and index coding 

Recall that a matroid is a pair M = (E,r) where E is a ground set and r : 2 E — > N is a rank 
function satisfying 

(i) r(A) < \A\ for all AC£; 

(ii) r(A) < r(B) for all A C 1? C £? (monotonicity) ; 

(iii) r(i)+r(B)>r(AUB) + r(Afl £?) for all A,B C E (submodularity). 

The rank vector of a matroid, r(M), is a 2 -dimensional vector indexed by subsets of E, such 
that its 5-th coordinate is r(5). A subset 5 C E is called independent if r(5) = |5| and it is called 
a frasis of M if r(5) = |5| = r{E). 

In this section we give a construction mapping a matroid to an instance of index coding that 
exactly captures the dependencies in the matroid. We proceed to show some useful connections 
between matroid properties and the broadcast rate of the corresponding index coding problem. 

Definition 5.1. Let M = (E,r) be a matroid. The hypergraph index coding problem associated 
to M, denoted by Gm, has a message set E and all receivers of the form 

{(x,S) I xeE, SCE, r(5) = r(5u{x})} . 

Remark. A similar yet slightly more complicated construction was given in |24j . Our construction 
is (essentially) a subset of the one appearing there. A construction that maps a matroid to a 
network coding problem is given in [111112]. They prove an analog of Proposition 15.21 

Proposition 5.2. For a matroid M = (E,r), 6(Gm) = \E\ — r(E). 

Proof. In what follows we will let n = \E\ and r = r(E). To show that o{Gm) < n — r it suffices 
to show zs = r(5) + n — r is a feasible primal solution to the LP *B(Gm)- The feasibility of 
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constraints (w) and (x) follows trivially from the the definition of Gm and properties of a matroid. 
The feasibility of (y) : zt — z$ < cst V5 C T follows from repeated application of submodularity: 

ZT - Zs = r (T) - r(S) < r{S U {x}) - r(S) 

x£T\S 

< (r(SU{x})-r(S)) + r({x})<\T\cl(S)\ = c S T. 

xecl(S) x£T\cl(S) 

To prove the reverse inequality, let S be any basis of M and note that = ze — (ze—zs) — (zs—zq) > 
n - c S e ~ C05 = n - r. ■ 

The following definition relaxes the notion of a representation for a matroid. 

Definition 5.3. A matroid M = (E,r) with \E\ = n is under-representable in d dimensions over 
a finite field F if there exists a d x n matrix with entries in F and columns indexed by elements of 
E such that (i) the rows are independent and (ii) if r(x U S) = r(S) then the columns indexed by 
x U S are dependent. 

Observe that if a matrix represents M then it also under-represents M. We next show a relation 
between under-representations for M over F and the scalar linear rate A^, where the alphabet vector 
space, over which the encoding functions are required to be linear, is single-dimensional. Note that 
A F < Af . The following is the analogue of Theorem 8 in |24| for our version of the matroid to index 
coding mapping. 

Theorem 5.4. A matroid M = (E,r) with \E\ = n is under-representable in d dimensions over 
a finite field F if and only if \\(Gm) < n — d. In particular, if M is representable over ¥ then 
\ ¥ (G M ) = f3(G M ) = n-r(E). 

Proof. Let R be a d x n matrix which under-represents M in d dimensions over F. Let Q be an 
(n — d) x n matrix whose rows span the kernel of R. We will show that Q is a valid encoding matrix 
for Gm- Let y £ ¥ E be some input message set and consider a receiver (x,S), who wishes to decode 
y x from {y z : z G S} and the broadcast message Qy. Extend ker(Q) arbitrarily into a basis B for 
¥ E and let y = y' + y" be the unique decomposition according to B such that y' E ker(Q). Clearly, 
Qy" = Qy since y' E ker(Q), hence one can recover y" from the public channel by triangulating Q. 
It remains for the receiver (x, S) to recover y' x . To this end, observe that the rows of R span ker(Q) 
and recall that by Definitions 15.11 and 15. 3| column x of R is a linear combination of the columns 
of R indexed by S. Since y' is in the row-space of R it follows that y' x is equal to the exact same 
linear combination of the components of y' indexed by S, all of which are known to the receiver. 
Altogether, the receiver can recover both y' x and y x and obtain the message x. As this holds for 
any receiver, we conclude that Q is a valid encoding matrix and thus A[(Gm) < n — d. When 
d = r(E) the inequality is tight because this upper bound coincides with the lower bound given by 
Proposition 15.21 

Conversely, suppose that there exists a scalar linear code for Gm over F with rate n — d, and 
let Q be a corresponding (n — d) x n encoding matrix of rank n — d. Let Rbearfxn matrix whose 
rows span the kernel of Q. We claim that R under-represents M. Indeed, consider a receiver (x,S). 
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It is easy to verify that this receiver has a linear decoding functiorjl of the form u T ■ Qy + v T ■ ys 
for some vectors u, v, where ys is the vector formed by restricting y to the indices of S. As Q is 
a valid encoding matrix for Gm, this evaluates to y x for any y £ ¥ E . In particular, if y T is a row 
of R then Qy = and so v T ■ ys = y x , and applying this argument to every row of R verifies that 
column x of R is a linear combination of the columns of R indexed by S (with coefficients from v). 
Since this holds for any receiver we have that R under-represents M, as required. ■ 

We conclude this section with a result that will be useful in establishing lower bounds on the 
value of the LP for Gm with a given constraint matrix A. 

Theorem 5.5. Suppose that M = (E,r) is a matroid and A is a matrix such that Al = and 
Ar(M) ^ 0. If the linear program in Figure{l\ is instantiated with constraint matrix A, then the 
value of the LP is strictly greater than \E\ — r(E). 

Proof. We will give a dual solution (w,x,y) to the LP with value strictly greater than \E\ — r(E). 

Recalling the hypothesis Ar(M) ^ 0, let q be a row of A such that YIsce a qSr{S) < 0. Let 
S+ = {S C E | a qS > 0, S / £,0} and S~ = {S C E \ a qS < 0, S + f,0}. Note that the 
hypothesis that A\ = implies that a q $ + J2seS+ a <iS = — { a qE + YlseS- a qs) • Assume that A 
is scaled so a q % + YlseS+ a qS = — { a gE + X^Ses- a qs) = 1- This assumption is without loss of 
generality since a q E + Yls&s~ a qS is strictly negative, as can be seen from the following calculation: 



r(E) 



(a qE + ^2 a qS j <a qE r(E)+ ^ a qS r(S) < a qE r(E) + ^ a qS r(S) + ^ a qS r(S) 



Define the dual vector y by setting y q = 1 and y q > = for rows q' ^ q of A. To define the dual 
vector x, let us first associate to every set 5 C E a matroid basis b(S) such that the set m(S) = 
b(S) n S is a maximal independent subset of S, i.e. 1^(5)1 = r(m(S)) = r(S). Let u(S) = SUb(S). 
For every S G S+, let x^ S ) = x m ( S )S = a q s and for every S G 5", let x Su ( S ) = x u (S)E = ~a q s- 
Set all other values of xst to zero. Finally, set w = 1. By construction, (w,x,y) satisfies all of the 
dual constraints. Using the relations c$ m (s) = f(S), cs u (s) = r(E) — r(S), c m (s)S = c u(s)E = 0, we 
find that the dual LP objective value is 

\ E \ w c stxst = \E\- ^2 ( c <Dm(s) + c m (s)s) a qs - X] ( c Su(s) + c u(s)e) {-a g s) 
sct ses+ ses- 

= \e\-J2 r ( s ) a «s + E ^ - r ^ a ^ 

ses+ ses- 

= \E\+ J2 a gSr(E) - J2 a qS r(S) + a g0 r(0) + a qE r{E) 
ses- s 

= \E\-r(E)-J2a q sr(S). 

s 

By hypothesis a q sr(S) < 0, and the proposition follows. ■ 



5 This follows e.g. from decomposing y as above into y' + y" where y' € ker(Q). By definition y" is a linear 
combination of the Qy entries. Similarly, y' x must be a linear combination of {y z : z £ S}, otherwise there would 
exist some y £ ker(Q) with y x ^ and y z = for all z £ S, making it indistinguishable to this receiver from y — 0. 
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6 Separation between linear and non-linear rates 



In this section we prove Theorem II. 2 [ To this end we will first show that the linear rate over a 
field of even characteristic is strictly better than the linear rate over a field of odd characteristic for 
the index coding problem associated to the Fano matroid, and that the reverse relation holds for 
the non-Fano matroid. Then we will take the lexicographic product of the two index codes to get 
a gap between the linear and non-linear coding rates, and then use lexicographic products again to 
amplify that gap. 

The Fano matroid, denoted T , and the non-Fano matroid, denoted J\f, are 7 element, rank 3 
matroids. The seven columns of the matrix 



constitute a linear representation of the Fano matroid when char(F) = 2 and one for the non- 
Fano matroid when char(F) ^ 2. We will use U = {100,010,001,110,101,011,111} to index the 
elements of the two matroids. Further let O C IA be the vectors with odd hamming weight, let B 
be the vectors with hamming weight one and let i + j for i,j 6W be the bitwise addition of i,j. 

It is well known that the Fano matroid is representable only in a field of characteristic 2, and 
the non-Fano matroid is representable in any field whose characteristic is different from 2 but not 
in fields of characteristic 2. We use a generalization of this fact to prove the following theorem that 
directly implies Theorem 11.21 

Theorem 6.1 (Separation Theorem). Let G = Gjr • G_\f. There exists some e > such that 
p(G' n ) = 16 n whereas \(G' n ) > (16 + e) n for all n. 

The fact that f3{G ,n ) = 16 n will be a straightforward application of Proposition 15.21 and The- 
orem 15.41 The lower bound on the linear rate however will require considerably more effort. In 
order to bound A from below we will extend the LP 03 to two LPs, one of which will be a lower 
bound for linear codes over fields with odd characteristic and the other for linear codes over even 
characteristic. Each one will supplement the matrix A in the LP with a set of constraints, one 
set derived from dimension inequalities based on the representation of the the Fano matroid and 
the other from the non-Fano matroid. The LP that gives a lower bound for linear codes over a 
field with even characteristic will be used to show that the linear broadcast rate of GV over a field 
of even characteristic is strictly greater than four, and the LP for odd characteristic will imply 
the corresponding result for Gp. Furthermore, the constraints will satisfy the conditions of Theo- 
rem 13.41 Putting this all together implies that when we take the lexicographic product of the Fano 
and non-Fano index coding problems, no linear code is as good as one that combines linear codes 
over F2 and F3. 

Before explaining how we derive these constraints, we introduce a bit of notation. If {Vi}i^i are 
subspaces of a vector space V, let the span of Vi and Vj be denoted V{ + Vj and let dim({Vi}j e /) 
be the dimension of the span of {Vi}^. Also, let d({Vi\i & i) be a 2^ dimensional vector indexed 
by the subsets of I such that the coordinate indexed by S is dim({Vi}i<=s)- We let V\ © • • • © \% 
denote the sum of mutually complementary subspaces V\, . . . ,Vk- If V = V\ © • • • © Vk then V is 
isomorphic to the vector space Yli=i ^ v ^ a * ne mapping (v±, . . . , vp.) i->- v\ H + v%. In this case, 




1 
1 







1 



111 
10 11 
110 1 



15 



for an index set S C {1, . . . , k}, we will use ns to denote the projection function V — > ©igsVi, i.e. 
the function that maps an element v = Ym=i v i t° the element ^(u) = YlieS v i- 

The fact that the Fano matroid can be represented over F2 and the non-Fano matroid cannot 
tells us something about dimension dependencies that can occur in F2. The following lemma is 
extracting the critical dimension relations that distinguish vector spaces over F with char(F) = 2. 

Lemma 6.2. Let V = V\ ffi V2 ffi V3 be a vector space over a field ¥, and suppose W C V is a linear 
sub space that is complementary to each of V\ © V2, V\ © V3, V2 ffi V3. Then 

, . . . . . . . f2dim(W r ) i/char(F) = 2 , % 

dim(vr 12 (W),vr 13 (W),vr 23 (W))= ; (6.1) 

[3dim(W) if char(F) 7^ 2. 

Proof. Recalling that V is isomorphic to Y\a=i we wu ^ write elements of V as ordered triples. 
Our assumption that W is complementary to each of V\ ffi V2, Vi ffi V3, V2 © V3 implies that a nonzero 
element of W has three nonzero coordinates, a fact that we will use in both cases of the lemma. 

If char(F) = 2, then every vector (2, y,z) £V satisfies 

71-12(2, y, z) + 71-13(2, y, z) = (2, y, 0) + (2, 0, z) = (0, y, z) = 7r 23 (2, y, z) 

hence ni2(W) + ni3(W) = 7^3 (W). Consequently 

dim(7r 12 (iy),7r 13 (iy),7r23(^)) = dim (7n 2 (W)> ttisCW)) < 2dim(W). 

To prove the reverse inequality we observe that tti 2 (W) and 7ri3(W) are complementary, since every 
nonzero element of tti 2 (W) is of the form (2, y, 0) with x,y 7^ 0, whereas every nonzero element of 
^13(W) is of the form (2, 0, z) with i,z/0, and hence 7Ti 2 (W / ) n iriz{W) = {0}. 

When char(F) 7^ 2, we prove Equation (16. ip by showing that 7Ti 2 (W), 7ri3(W), 7r 2 3(W) are 
mutually complementary. Consider any three vectors W\ = (a?i,yi,zi), w 2 = (22, 2/2,^2), arid 
^3 = (23,2/3,-23), all belonging to W, such that 

= 7T 2 3 (21,2/1,21) + 7Tl 3 (2 2 ,y2,^2) + 7T12 (23 , 2/3 , ^3 ) = (x 2 + 2 3 ,yi + y 3 , Z 1 + 2 2 ) • 

This implies that 22 + 23 = 0, so the first coordinate of W2 + w$ is zero. However, the zero 
vector is the only vector in W whose first coordinate is zero, hence W2 + W3 = 0. Similarly, 
^1 + ^3 = and w\ + w 2 = 0. Now using the fact that 2 is invertible in F, we deduce that 
wi = |[(n>i + w 2 ) + (wi + W3) — (w 2 + Ws)] = 0, and similarly W2 = and ^3 = 0. Thus, the 
only way to express the zero vector as a sum of vectors in 7Ti2(W), 7ri3(W), iT23(W) is if all three 
summands are zero, i.e. those three subspaces are mutually complementary as claimed. ■ 

6.1 Linear codes over fields of characteristic two 

This section provides the ingredients for proving that A F (G^) > 4 for F with char(F) = 2. 

Lemma 6.3 (Conditional Even Characteristic Inequality). Suppose {VijieW are 7 subspaces of a 
vector space over F such that char(F) = 2 and 

(1) dim({Vi} ie0 ) = dim({V$} iGB ) 

(ii) dim(K, Vj,V k ) = dim(K) + drm(V,) + dim(Vfc) Vi, j, k£(D 
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(iii) dim(V, V,-,V i+i ) = dim(V, V,) Vi,j E O 
Then dim(Viio, Vioi, V u) < 2dim(V m )- 

Proof of Lemma \6.3[ Hypotheses (i) and (iii) of the lemma imply that all 7 subspaces are contained 
in the span of Vxoo, Vbio> Vboi- Moreover, hypothesis (ii) implies that Vloq, Vbio> Voqi are mutually 
complementary and that Vm is complementary to each of Vioo + Voio, Voo + Vooi , Vow + Vooi ■ 
Thus, we can apply Lemma IfPl with V = Voo © Vbio © Vooi and W = Vm, yielding the equation 
dim(7ri2(Viii),7r23(Fiii),-7r 13 (Viii)) = 2dim(V m ). 

We claim that 7ri2(Vm) = (Vooi + Vm) fl (Vioo + Vbio)- To see this, take an arbitrary element 
w 6 Vm having a unique representation of the form x + y + z with x E Vioo,y E Vq\q,z E Vooi- 
By definition ~K\i{w) = x + y = w — z, from which it can be seen at once that ni2(w) belongs to 
both Vioo + Voio and Vooi + Vm. Conversely, any element v E (Vooi + Vm) fl (Vioo + V)io) can be 
expressed as v = w — z where w E Vm, z E Vooi but it can also be expressed as v = x + y where 
x E Vioo, V £ Vbio- Consequently, w = x + y + z and v = ir^iw). 

Hypothesis (iii) implies that Vno is contained in both Vooi + Vm and Vioo + Vbio, hence 
Vno Q 7ri2(V m ). Similarly Vioi C 7ri 3 (Vm) and V n C 7r 23 (Vm). Hence dim(Vno, Voi, V n) < 
dim(7ri2(Vm),7r23(Viii),7ri 3 (Vm)) = 2dim(V m ), as desired. ■ 

In what follows we will transform the conditional inequalities given in the lemma above to a 
general inequality that applies to any 7 subspaces of a vector space over a field of characteristic 2 by 
using the following approach. We will start with arbitrary subspaces and then repeatedly modify 
them until they satisfy the conditions of Lemma 16.31 At that point the result in this conditional 
lemma will imply an inequality involving the dimensions of the modified subspaces, which we will 
express in terms of the dimensions of the original subspaces. 

Theorem 6.4 (Even Characteristic Inequality). There exists a 2' -dimensional vector A even such 
that for any 7 subspaces {V}ieW of a vector space over ¥ with char(F) = 2, 

A even ■ d({Vi} ie u) > and A 

even r(A0 < 0. 

Proof. As mentioned above, the proof will proceed by repeatedly modifying the input subspaces 
until they satisfy the requirements of Lemma [6.31 The modifications we make to a vector space are 
of one type: we delete a vector w from a subspace V that contains w, by letting B be a basis of V 
containing w and then replacing V with the span of B \ w. 

Let {V}ieW be seven subspaces of a vector space V over F such that char(F) = 2. We will 
modify the subspaces {V}j e ^ into {V/}j S ^ that satisfy the conditions of Lemma lOl To start, we 
set {V[}igj = {V}«eW- We then update {V/}j g ^ in three steps, each of which deletes vectors of a 
certain type in an iterative fashion. The order of the deletions within each step is arbitrary. 



Step 1: Vectors in V/u but not in YlieB^I f rom Vm 

Step 2: (a) Vectors in V{ 00 fl V 'i from V ' 10 . 

(b) Vectors in V ' 01 n (V/ 00 + V ' 10 ) from V ' ol 

(c) Vectors in V/ n n (V/ 00 + V ' 10 ) from V{ u 

(d) Vectors in V( u n (V ' 10 + V ' 01 ) from V( n 
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(e) Vectors in V{ n n (V^ + V^ 01 ) from V{ u . 
Step 3: Vectors in V( + j but not in V( + V- for i, j € O from V^' + -. 

First, we argue that {V(}i£u satisfy the conditions of Lemma [6.31 The deletions in step (1) 
ensure that V/-Q is contained in ^2 i& gV-, thus satisfying condition (i). The deletions in steps 
(2a)-(2b) ensure that V{ 00 , Vq 10 , Vq 01 are mutually complementary, and steps (2c)-(2d) ensure that 
V(n is complementary to the sum of any two of them, thus satisfying condition (ii). Furthermore, 
step (2) does not change Yli^B because we only delete a vector from one of -{V/jigg when it 
belongs to the span of the other two. Thus condition (i) is still satisfied at the end of step (2). Step 
(3) ensures that V- + j is contained in V[ + V-, thus satisfying condition (iii). Furthermore, it does 
not modify V[,i € O, and thus conditions (i) and (ii) remain satisfied after step (3). 

Now, by Lemma 16.31 we have that 

dim^oXoiXli) < 2dim(y 1 ' 11 ). (6.2) 

Let 

6 = dim(F m , {Vi} ie B) ~ dim({yi} iee ) 
5[i\j, k] = dim(V, Vj,V k ) - dim(y„ V k ) 

5{i; j] = dim(V n Vj) = dim(^) + dim(^) - dim(V h Vj) 
6[i;j, k] = dim{Vi n (Vj + V k )) = dim(V 4 ) + dim(Vj, V k ) - dim(V h Vj, V k ) 

Observe that after step (1) dim(V 1 / 11 ) = dim(Vni) — 5, and steps (2) and (3) only delete more 
vectors from V{ u , so we have dim(V{ u ) < dim(Vm) — 5. 

It remains to get a lower bound on diva(y{ 10 , V[ i, Vqh) in terms of dimensions of subsets 
of {Vi}^. We do this by giving an upper bound on the total number of vectors deleted from 
E = V{ 10 +V{ 01 +Vq U in terms of the 5 terms we defined above. In steps (1) and (2) we delete nothing 
from E, but we delete some vectors from V[, i £ O. Specifically, <5[100; 010] vectors are deleted from 
^010' <5[001; 100, 010] vectors are deleted from Vq 01 , and no vectors are deleted from Vioo- As already 
noted, step (1) deletes 5 vectors from V{ n , while step (2) deletes at most ^ gB #[111; vectors 
from V[u. To summarize, the dimensions of V-,i € O, after steps (1) and (2), satisfy: 



dim(^ioo) = dim(F 100 ) (6.3) 

dim(y ' 10 ) = dim(F 010 ) - *[100; 010] (6.4) 

dim(y ' 01 ) = dim(Fooi) - <$[001; 100, 010] (6.5) 

dim(n'ii) > dim(Fiu) (6.6) 

i,jeB 



In step (3), when we delete vectors in V- + j but not in V( + V-, if no deletions had taken place 
in prior steps then the number of vectors deleted from would be 5[i + However, the 

deletions that took place in steps (1) and (2) have the effect of reducing the dimension of V- + Vj, 
and we must adjust our upper bound on the number of vectors deleted from V- + j to account for 
the potential difference in dimension between Vi + Vj and V- + V-. When i = 100, j = 010, there 
is no difference between Vi + Vj and V( + Vj, because the only time vectors are deleted from either 
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one of these subspaces is in step (2a), when vectors in V[ 00 n V \ are deleted from V \ without 
changing the dimension of V[ 00 + V Q \ . For all other pairs i, j G O, we use the upper bound 

dim(K + Vj) - dim(l// + Vj) < [dim(V;) - dim(V/)] + [dim(Vj) - dim(V/)] , 

which is valid for any four subspaces Vi, Vj, V(, Vj satisfying V( C Vi, Vj C Vj. Let Adim(V^) denote 
the difference dim(Vi) — dim(V/). Combining these upper bounds, we find that the number of extra 
vectors deleted from E in step (3) because of differences in dimension between V- + V- and Vi + Vj 
is at most 

( Adim^) + Adim(V}) ] - Adim(Vioo) - Adim(Voio) 



= 2 Yl Adim(^) +3 Y Adim(^) 
\ie{ioo,oio} / \ie{ooi,iii} / 

< 25[100; 010] + 3J[001; 100, 010] + 35 + 3 Y £[111; i, j] 



where the last inequality follows by combining equations (|6.3p - (|6.6p . 

We now sum up our upper bounds on the number of vectors deleted from E in step (3), to find 
that 

dim(E) >dim(y lw ,V 101 ,V 01 i)-Y 8[i+j\i,j)-26[100; 010]-3<5[001; 100, 010]-3<5-3 Y <y [ 111 ;*.j] 

ijeo i,jeB 

(6-7) 

Expanding out all the 5 terms, combining with the upper bound dim(y i / 11 ) < dim(Vni) — 5, 
and plugging these into Equation (|6,2p gives us A even • d({V;}j e ^) > for some 2 7 -dimensional 
vector Ae V en, as desired; after applying these steps one obtains Equation (16. 8p below. When 
{Vi}ieu are one-dimensional subspaces constituting a representation of the non-Fano matroid over 
a field of characteristic 7^ 2, it is easy to check that all of the 5 terms appearing in (|6,7p are 
zero. So, the inequality states that dim(Vno, V101, Von) < 2dim(Vm), whereas we know that 
dim(Viio, V101, Von) = 3dim(Vin) for the non-Fano matroid. Consequently A Q dd • ^(J 7 ) < 0. 
For completeness, the inequality A even • d({Vj}j £ ^) > is written explicitly as follows. 

2dim(Vloo) + 2dim(V io) + 3dim(V oi) + lldim(Fni) 
+3dim(Vioo, Vbio) + 2 dim(Vl 00 , V 00 i) + 2dim(V io, Vboi) 

- dim(Vloo, Vln) - dim(Vbio, Vm) - dim(V 00 i, V m ) - 4 dim(Vi o, V ow , V oi) 
-3dim(Vm, Vioo, Vno) - 3dim(V m , Vi o, V 00 i) - 3dim(V m , Vbio, V 00 i) 

+ dim(Viio, Vloo, V010) + dim(Vi i, Vioo, Vboi) + dim(V n, Vbio, Vboi) 
+ dim(Vno, V m , Vboi) + dim(Vioi, Vm, Vbio) + dim(Vbn, Vm, Vioo) 

- dim(Vli , Vioi, Fon) + dim(V m , Vioo, Vbio, Vboi) > . (6.8) 

This concludes the proof of the theorem. ■ 

6.2 Linear codes over fields of odd characteristic 

The following lemma and theorem, which are analogues of Lemma 16.31 and Theorem 16.41 from 
Section 16.11 provide an inequality for fields with odd characteristic. 
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Lemma 6.5 (Conditional Odd Characteristic Inequality). Suppose {V}iew are 7 subspaces of a 
vector space over ¥ such that char(F) ^ 2 and 

(i) dim({Vi} ie o) = dim({Vi} ieB ) 

(ii) dim(Vi, Vj, V k ) = dim(V-) + dim(V}) + dim(Vfe) Vi, j, k G O 

(iii) dim(V, V„ V i+j ) = dim(V, V,) Vi,j G S 

fwj dim(V, V„ V m ) = dim(V, V}) Vi, j : * + j = 111 

T/ien dim(Viio,Vioi,Vbil) > 3dim(V m ). 

Proof. Just as in the proof of Lemma 16,31 we apply the result of Lemma 16.21 but now with 
char(F) ^ 2. Hypotheses (i) and (iii) imply that all 7 subspaces are contained in the span of 
Vioo> ^biO) ^ooi) and hypothesis (ii) implies that those three subspaces are mutually complementary, 
and that Vm is complementary to the sum of any two of them. Thus, Lemma 16.21 implies that 
dim(Viio, Vioi, Vbn) = 3dim(W). Now we aim to show that hypotheses (iii) and (iv) imply that 
Vno contains ^(Vm), and similarly for Vioi,Vbn. This will imply that dim(Vno, Vioi, Vqh) > 
dim(7Ti2(VF), -K2z{W), Kl3(W)) = 3dim(W) as desired. 

It remains for us to justify the claim that Vno contains 7ri2(Vm). Suppose (x,y,z) belongs to 
Vm, where we use (x, y, z) as an alternate notation for x+y+z such that x belongs to Vioo> V belongs 
to Vbio, z belongs to Vboi- We know from hypothesis (iv) that Vm is contained in Vboi + Vno- So 
write x + y + z = a + b where a is in Vboi and b is in Vno- We know from hypothesis (iii) that Vno is 
contained in Vroo+VbiO) so write b = c+d where c is in Vioo and d is in Vbio- Then x+y+z = c+d+a, 
and both sides are a sum of three vectors, the first belonging to Vioo, the second to Vqio, the third 
to Vboi • Since those three vector spaces are mutually complementary, the representation of another 
vector as a sum of vectors from each of them is unique. So x = c, y = d, z = a. This means that 
x + y = c + d = vri2(x, y, z). Recall that c + d is in Vno- As (x, y, z) was an arbitrary element of 
Vi.ii, w e have shown that Vno is contained in 7Ti2(Vqi). ■ 

Theorem 6.6 (Odd Characteristic Inequality). There exists a 2 7 -dimensional vector A Q dd such 
that for any 7 subspaces {V}ieW of a vector space over ¥ with char(F) ^ 2, 

Aodd • d({V} ieU ) > and A odd • r(T) < . 

Proof. Let {V}ieW be seven subspaces of a vector space V over F such that char(F) ^ 2. Just 
as in the proof Theorem 16.41 we will modify the subspaces {V}j S ^ into {V/}j g ^ that satisfy the 
conditions of Lemma 16,5} starting with {V/}j £ ^ = {V}j e ^. We again delete vectors of a certain 
type in an iterative fashion. The order of the deletions within each step is arbitrary. 



Step 1: Vectors in Vm but not in X^eB^i' f rom V{ n 

Step 2: (a) Vectors in V{ 00 n V ' 10 from V ' 10 . 

(b) Vectors in V ' 01 n (V/ 00 + V ' 10 ) from V ' 01 

(c) Vectors in V( u n (V/ 00 + V ' 10 ) from V{ u 

(d) Vectors in V( u n (V ' 10 + V ' 01 ) from V/ n 
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(e) Vectors in V{ n n (V^ + V^ 01 ) from V{ u . 
Step 3: Vectors in V( + j but not in V- + Vj for i, j € B from 
Step 4: Vectors in V{u but not in V( + Vj for i,j : i+j = 111 from V/n- 

The first two steps in this sequence of deletions, along with the first two conditions in Lemma [6. 5 1 
are identical to those in the even characteristic case. Thus, by arguments from the proof Theo- 
rem [63] we have that by the end of step (2) conditions (i), (ii) are satisfied. Step (3) is almost 
identical to the same step in the even characteristic case; the difference is that now we only perform 
the step for pairs i,j £ B rather than all pairs i,j € O. As before, at the end of step (3) condition 
(iii) is satisfied, and since the step does not modify V- for any i G O, it does not cause either of 
conditions (i), (ii) to become violated. Step (4) ensures condition (iv), so it remains to show that 
step (4) preserves conditions (i)-(iii). Step (4) only modifies V{ n so it doesn't change YlieB^i^ 
therefore preserving (i). It preserves (ii) because if three subspaces are mutually complementary, 
they remain mutually complementary after deleting a vector from one of them. It preserves (iii) 
because (iii) does not involve V(n, which is the only subspace that changes during step (4). 

Now, by Lemma 16.31 we have that 

3dim(^ u ) < dim^Ao^i'oiXii)- (6-9) 

As in the proof of Theorem 16.41 let 

S = dim(Vin, {Vi}i eB ) ~ dim({yj} ieS ) 
5[i\j, k] = dim(V h Vj,V k ) - dim(y„ V k ) 

8[i; j] = dim(Vi n Vj) = dim(^) + dim(^) - dim(V h Vj) 
5[i;j, k] = dim(Vi n (Vj + V k )) = dim(V-) + dim(^, V k ) - dim(^, Vj, V k ) 

Observe that we only reduce the size of subspaces, so dim(l/ 1 ' 10 , V{ 01 , Vq U ) < dim(Viio, V±oi, Von). 

It remains to get a lower bound on dim(y i ' 11 ) in terms of dimensions of subsets of {Vi\i^u- We 
do this by giving an upper bound on the number of vectors we delete from V(n in terms of the 5 
terms we defined above. Step (1) deletes 5 vectors. Steps (2a) and (2b) delete nothing from V{ u , 
and at the end of (2a)— (2b) we have 

dim(V?oo) = dim(y 100 ) (6.10) 
dim(V ' 10 ) = dim(Voio) - «5[100; 010] (6.11) 
dim(V r ' 01 ) = dim(Vooi) - £[001; 100,010] (6.12) 

Steps (2c)-(2e) delete at most ^2ij GB S[lll;i,j] vectors from V{ lt , and they do not change any of 
the other subspaces. 

In step (3) no vectors are deleted from V[n, but we will still need an upper bound on the 
number of vectors deleted in this step since it will influence our upper bound on the number 
of vectors deleted from V{ n in step (4). If no deletions took place prior to step (3), then for 
all i,j £ B exactly 5[i + vectors would be deleted from V- + j during step (3). However, if 

dim(V- ,Vj) < dim(Vi, Vj), then we must adjust our estimate of the number of deleted vectors to 
account for this difference. Steps (1) and (2a) cannot change dim(V^, Vj) for any i,j £ B, but step 
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(2b) reduces each of dim(VQ 01 , V{ 00 ) and dim(Vg 01 , Vq 10 ) by at most £[001; 100,010]. Therefore, at 



the end of step (3) we have 

dim(V( 10 ) = dim(Viio) - <5[110|100, 010] (6.13) 

dim(Vi' 01 ) > dim(Vioi) - <5[101|100, 001] - £[001; 100,010] (6.14) 

dim(y / 11 ) > dim(Fon) - <5[011|010, 001] - £[001; 100, 010] (6.15) 



If no deletions took place prior to step (4) , then the number of vectors we would need to delete 
from V{ n , to make it a subspace of V- + Vj, would be at most £[lll|i, j]. As before, we need to 
adjust this bound to account for the potential difference in dimension between V% + Vj and V( + Vj. 
Using the upper bound 

dim(K + Vj) - dim(V~/ + Vj) < [dim(V) - dim(V~/)] + [dim(V,) - dim(V-)] , 

which is valid for any four subspaces Vi, Vj, V(, Vj satisfying V( C Vi, Vj C Vj, we find that the 
number of extra vectors deleted from V(u in step (4) because of differences in dimension between 
V( + Vj and Vi + Vj (for some i,j&U,i+j = 111), is at most 

dim(V t ) -dim(V7) < £[100; 010] + 3<J[001; 100,010] + ^ S[i + j\i,j], 
iew\{m} ijeB 

where the first inequality follows by combining equations (|6. 10j) — ()6. 15p . 

We now sum up our upper bounds on the number of vectors deleted from Vj u in steps (l)-(4) 
combined, to find that 

dim(V{ u ) > dim(y m )-5- ^ ^[111;*, j}-S[100; 010] -3<5[001; 100,010]- ^ 6[i+j\i,j}. (6.16) 

i,jeB i,jeB 

Expanding out all of the 5 terms, combining with the upper bound on dim(y i ' 11 ), and plugging 
these into Equation (|6.9p gives us A Q dd • d-({Vi}i£u) > for some 2 7 -dimensional vector A Q dd, 
as desired; after applying these steps one obtains Equation (]6.17p below. When {V}ieu are one- 
dimensional subspaces constituting a representation of the Fano matroid over a field of characteristic 
2, it is easy to check that all of the 5 terms appearing in (|6.16p are zero. So, the inequality states 
that dim(Vno, Vioi, Vbn) > 3dim(Vm), whereas we know that dim(Vno, Vioi, Vbn) = 2dim(Vni) 
for the Fano matroid. Consequently A^ ■ r(J r ) < 0. 

For completeness, the inequality A Q dd ■ d({V^}j e ^) > is written explicitly as follows. 

3dim(Uioo) + 3dim(y io) + 9dim(Vboi) +6dim(F m ) + 6 dim(Vi o, V ow ) - 12 dim(Vi o, V ow , Vboi) 
+3dim(Fii , Vioo, Vqiq) + 3dim(Vioi, Vioo, Vboi) + 3dim(Vbn, Vqiq, V oi) 
-3dim(Viu, Vim, Vbio) - 3dim(Viu, Vioo, Vboi) - 3dim(V m , Vbio, Vboi) 

+3dim(Fiii,Vioo,Vbio ! Vboi) + dim(Vii 0j Vioi,Vbii) >0. (6.17) 
This completes the proof of the theorem. ■ 
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6.3 Polynomial separation between the linear and non-linear rates 



The following pair of lemmas shows how to take a single linear constraint, such as one of those 
whose existence is asserted by Theorems 16.41 and 16.61 an d transform it into a tight homomorphic 
constraint schema. To state the lemmas, we must first define the set of vectors Df(K) C M?( K ', for 
any index set K and field F, to be the set of all vectors d({Vk}keK), where {Vk\keK runs through 
all K-indexed tuples of finite-dimensional vector spaces over IF. 

Lemma 6.7 (Tightening Modification). Suppose I is any index set, e is an element not in I, and 
J = I U {e}. There exists an explicit linear transformation from to , represented by a 

matrix B, such that: 

(i) B ■ D ¥ (J) C D ¥ (I) for every field F. 
(ii) Bl = Blj = for all j G J. 

(Hi) If M is a matroid with ground set I and the intersection of all matroid bases of M is the 
empty set, then Br(M + e) = t(M), where M + e denotes the matroid obtained by adjoining 
a rank-zero element to M . 

Proof. If U is any vector space with a J-tuple of subspaces {Uj}j<=j, then there is a quotient map tt 
from U to V = U /U e , and we can form an /-tuple of subspaces {V^jig/ by specifying that V, = ir(Ui) 
for all i E I. The dimension vectors u = d({Uj}) and v = d({Vi}) are related by an explicit linear 
transformation. In fact, for any subset S C 7, if we let Us, Vs denote the subspaces of U, V spanned 
by {Ui}i<zs and {Vi}i e s, respectively, then tt maps Us + U e onto Vs with kernel U e , and this justifies 
the formula 

V S = Usuje} - U{ e }. 

Thus, v = BqU, where Bq is the matrix 

(Bo) ST 



( 1 if T = S U {e} 
-1 ifT = {e} (6.18) 
otherwise, 



and therefore Bq - D$(J) C D$(I). 

Similarly, if U is any vector space with an /-tuple of subspaces {Ui}i<zj and k is any element 
of /, we can define C U to be the linear subspace spanned by {C/j}^, and we can let 
tt : U — > U-k be any linear transformation whose restriction to is the identity. The restriction 
of 7r to Uk has kernel Wk of dimension diu^W/c) = dim({[/j}j e /) — dim({C/i}i e j ) i^). As before, 
let Vi = 7r(L/j) for all i 6 J, let Us,Vs denote the subspaces of U, V spanned by {Ui}i<=s and 
{V} ieS , and let u = d({[/,}),v = d({T^}). If k S then V s = U s and v s = u s , while if 
k € S then Us contains Wk, the linear transformation tt maps Us onto Vs with kernel W^, and 
vs = us — dim(Wfc) = - u/ + unai. Thus, v = -B^u, where B^ is the matrix 



(Bk) 



ST 



1 if T = S 

1 if k G S and T = / \ {k} 

XX 1 (6.19 

-1 if k G S and T = I 
otherwise. 



and therefore B^ ■ D ¥ (I) C D ¥ (I). 



23 



Now assume without loss of generality that I = {1, 2, . . . , re} and let B = B n B n _\ ■ ■ ■ B\Bq. 
We have seen that B ■ D^(J) C D^(I). From (]6. 18|) one can see that BqI = Bol e = and that 
for every k G /, -Bolfc = Ifc- (Here, it is important to note that on the left side refers to a 
vector in and on the right side it refers to a vector in M.^^.) Furthermore, from (|6.19p one 

can see that B^lk = and that B^li = \% for all i ^ k. Thus, when we left-multiply a vector 
w G {1} U {lj}j g j by the matrix B, one of the following things happens. If w is equal to 1 or l e 
then Bqw = hence Bw = 0. Otherwise, w = 1^ G M^"^ for some k £ I, Bqw = 1/% G IR 7 '^, and 
as we proceed to left-multiply 1^ by B\, B2, ■ ■ ■ , it is fixed by Bi {i < k) and annihilated by B^, so 
once again Bw = 0. This confirms assertion (|n|) of the lemma. 

Finally, if M, M + e are matroids satisfying the hypotheses of assertion (|mj) , then for ev- 
ery set S C / we have r(S U {e}) — r({e}) = r(S') and hence Bqt{M + e) = r(M). For any 
k G / our assumption on M implies that it has a matroid basis disjoint from {k}, and hence that 
r(I \ {k}) = r{I). Inspecting (I6.19p . we see that this implies Bkr(M) = r(M) for all k £ I, and 
hence Br(M + e) = r{M) as desired. ■ 

Lemma 6.8 (Homomorphic Schema Extension). Let I be an index set, and let a E be a 

vector such that a T d > for all d G D^(I). Then there is a homomorphic constraint schema 
(Q, A) such that a T is a row of the matrix A(I), and for every index set K and vector d G D$(K), 
A(K)d > 0. If a T l = a T li = for all i G /, then the constraint schema (Q, A) is tight. 

Proof. For any index set J, let Q{J) be the set of all Boolean lattice homomorphisms from V(I) 
to V{J). If h : V(J) —> V{K) is another Boolean lattice homomorphism, the mapping h* is defined 
by function composition, i.e. h*(q) = ho q. 

To define the constraint matrix A( J) associated to an index set J, we do the following. A row 
of A( J) is indexed by a Boolean lattice homomorphism q : V{I) — > V(J) and we define the entries 
of that row by 

q(T)=S 

This defines a homomorphic constraint schema, because if h : V{ J) — > V{K) is any Boolean lattice 
homomorphism and R = A(K) 1 Qh, R' = PhA(J) T , then recalling the definitions of P^, Qh we find 
that 

R Sq = (A(K) T ) S:h4q) = A(K) hoq>s = Yl a T 

Tep{i) 

h(q(T))=S 

R' Sq = Yl ( A (j) T ) S ', q = £ E <*? 

S':h(S')=S S':h(S')=S TeV(I) 

q(T)=S> 

and the right-hand sides of the two lines are clearly equal. 

To prove that A(K)d > for every d G Df(K), we reason as follows. It suffices to take a single 
row of the constraint matrix, indexed by homomorphism q : I K, and to prove that 

£ A(K) qS d s >0. 

seV(K) 
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Using the definition of the constraint matrix entries, this can be rewritten as 

Y a T d q(T) > 0. (6.21) 
TeV(l) 

Let {Vk}keK be a if -tuple of vector spaces such that d = d({Vk}keK)- Define an /-tuple of vector 
spaces {Ui}i£i by setting {7, to be the span of {Vk}k&q{{i})- By our hypothesis on a, 

Y a T dim({Ui} ieT ) >0. 
TeV(i) 

The left side is equal to the left side of (|6.2ip . 

Finally, suppose that a T l = dflj for all i E I. For any index set J, we prove that A(J)1 = 
by calculating the component of A(J)1 indexed by an arbitrary Boolean lattice homomorphism 
q:V(I)^V(J). 

Y MJ)qs = Y Y aT = Y ar = 5X1 = 

sev(j) SeV(J)Ter(i) rev(i) 

q(T)=S 

The proof that A(J)lj = for all j G J similarly calculates the component of A(J)lj indexed by 
an arbitrary q. 

Y A (j)gs= Y Y aT= Y aT 

seV(J) seV(J)TeV(i) TeV(l) 

jeS jeS q(T)=S jeq(T) 

At this point the argument splits into three cases. If j E g(0) then the right side is a T l, which 
equals 0. If j q( J) then the right side is an empty sum and clearly equals 0. If j g(0) but 
j G q(J), then there is a unique i £ I such that j € q({i}). Indeed, if j belongs to q({i}) and 
q({i'}), then j belongs to q({i}) H q({i'}) = q({i} H {«'}), implying that {i} fl {i'} is non-empty and 
that i = i' . The right side of the equation above is thus equal to <3 T li, which equals 0. ■ 

Finally, before proving Theorem 16. H it will be useful to describe the following simple operation 
for combining constraint schemas. 

Definition 6.9. The disjoint union of two constraint schemas (Qi,A\) and (Q2,A2) is the con- 
straint schema which associates to every index set / the disjoint union Q(I) = Qi(I) U 0.2(1) and 
the constraint matrix A(I) given by 



A(I) qS 



Ai(J) ffS ifgeQi(I) 
A 2 (I) qS i£qeQ 2 (I). 



For a homomorphism h : V(I) V{J), the function h* : Qi(I) U Q 2 {I) -> Q\{J) □ Qz{J) is defined 
by combining Qi(J) — ^ Qi(J) an d Q2CO Q 2 {J) in the obvious way. 

Lemma 6.10. The disjoint union of two tight constraint schemas is tight, and the disjoint union 
of two homomorphic constraint schemas is homomorphic. 
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Proof. For all index sets I and vectors v £ R^^ , the constraint matrix of the disjoint union satisfies 

A(lW-( Ml) )v-( Ml)v 
A()V "UWJ V ~U(i)v 

so if both constraint schemas are tight then so is their disjoint union. If h : V(I) — > V(J) is a 
Boolean lattice homomorphism then let Qih, Q2h, Qh denote the matrices representing the induced 
linear transformations R Ql( -^ -> R Ql ( J \ R Q2 ^ -> M 22 ( J ), and R Q W -> M Q ( J ), respectively. If 
both constraint schemas are homomorphic, then 

A(J) T Q h = (Ax(J) T A 2 (J) T )( Q ^ h Q 2f ) =(MJ) T Qih A 2 (J) T Q 2h ) 

= {P h A 1 (J) T P h A 2 (J) T ) = P h A(J) T , 

which confirms that the disjoint union is homomorphic. ■ 

The proof of Theorem 16.11 now follows by combining earlier results. 

Proof of Theorem 16.11 (Separation Theorem). The fact that (3(Gjr) = f3{Gj^) = 4 is an im- 
mediate consequence of Theorem [531 The submultiplicativity of f3 under the lexicographic product 
(Theorem [2J| then implies that G = GfG A r satisfies (3{G' n ) < (4 • 4) n = 16™. A lower bound of 
the form (3{G' n ) > 16 n is a consequence of Proposition HT21 which implied that b(Gp) = b(Gj\f) = 4, 
from which it follows by the supermultiplicativity of b under lexicographic products (Theorem 13. 4p 
that 16 n < b(G' n ) < f3(G' n ). Combining these upper and lower bounds, we find that j3{G* n ) = 16 n . 

It is worth noting, incidentally, that although each of Gjr, Gj\f individually has a linear solution 
over the appropriate field, the index code for G = Gjr • GV implied by the proof of Theorem 12.21 
- which concatenates these two linear codes together by composing them with an arbitrary one- 
to-one mapping from a mod-2 vector space to a mod-p vector space (p odd) — is highly nonlinear, 
and not merely a side-by-side application of two linear codes. 

To establish the lower bound on A F (G" n ), we distinguish two cases, char(F) = 2 and char(F) ^ 2, 
and in both cases we prove X ¥ (G' n ) > (16 + e) n using the LP in Figured] with a tight homomorphic 
constraint schema supplying the constraint matrix A. We use different constraint schemas in the 
two cases but the constructions are nearly identical. Let M denote the matroid J\f if char(F) = 2, 
and let M = J- if char(F) ^ 2. In both cases, we will let M + e denote the matroid obtained 
by adjoining a rank-zero element to M , and we will denote the ground sets of M, M + ebyI,J, 
respectively. Recall the vectors A even ,A 0( jd £ R^^ from Theorems 16.41 and 16.61 Let A = A even if 
char(F) = 2, A = A odd if char(F) / 2. By Theorems E3 and ESJ A • r(M) < 0, a fact that we will 
be using later. 

Recall the linear transformation B : RT^ — > IR^) from Lemma 16.71 and let 

a = B T A. 

For any d € D^(J) we have a T d = A T Bd > 0, since Bd £ D$ (I) and A • v > for all v € D-g (I) . 
The equations Bl = Blj = for all j G J imply that q t 1 = a T lj = 0. Applying Lemma 16.81 
to obtain a tight homomorphic constraint schema from a, and taking its disjoint union with the 
submodularity constraint schema, we arrive at a tight homomorphic constraint schema (Q, A) such 
that every vector d € Dp(K), for every index set K, satisfies the system of inequalities A{K)d > 0. 
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Consider the LP in Figure Q3 instantiated with constraint schema (Q,A). We claim that its 
optimal solution b ¥ (G), for any index coding problem 67, satisfies b ¥ (G) < X ¥ (G). To prove this 
we proceed as in the proof of Theorem I3.lt consider any linear index code over F with message 
alphabet E, sample each message independently and uniformly at random, and consider the input 
messages and the broadcast message as random variables, with H(S) denoting the joint entropy of 
a subset S of these random variables. The set of random variables is indexed by K = V(G) U {e}, 
where V(G) denotes the set of messages in G and e is an extra element of K corresponding 
to the broadcast message. Letting M k , for k € K, denote the matrix representing the linear 
transformation defined by the k th random variable, and letting U k denote the row space of Mp,, we 
have H(S) = log |F| • dim({U k } keS ) for every S C K. For S C V{G) let 

H(SU{e}) / jggjFj N .. fn mx , 

zs = log isi = [^m) dim(c/e ' {Uihes) - 

We aim to show that z satisfies the constraints of the LP, implying that b ¥ (G) < z$ = -^jigr and 
consequently (since the linear index code over F was arbitrary) that b ¥ (G) < X ¥ (G). The proof 
of Theorem 13.11 already established that Zy(G) = | V(G) | and that zt — zs < cst for all S C T, 
so we need only show that Az > 0. As in the proof of Lemma 16.71 we let ir denote the quotient 
map from U = £ y(Gf) to U/U e , we define Vi = 7r(J7j), and we observe that for all S C V(G), 
dim({Vi}i<=s) = dim(C/ e , {Ui}i£s) ~ dim(C/ e ). This implies that 



log |F| 
log |E| 



d. 



where d = d({T^}). Our construction of A implies that Ad > and that Al = 0, hence Az > 0. 

It remains to show that b ¥ (G) > 16, from which the claimed lower bound follows by supermul- 
tiplicativity. Since our constraint schema includes submodularity, we have b ¥ (Gjr) > b(Gjr) = 4 
and b F (Gj^) > b(G_\f) = 4, so we need only show that one of these inequalities is strict, and we 
accomplish this using Theorem 15.51 Specifically, we show that the matrix A = A(I) has a row 
indexed by some q G Q(I), such that (Ar(M)) q < 0. Recall that Q(I) is the set of Boolean lattice 
homomorphisms from V(J) to V(I), where J = I U {e}. Let q be the homomorphism that maps 
{e} to and {i} to itself for every is/. To prove that (Ar(M)) q < 0, we let r(-) and r(-) denote 
the rank functions of M, M + e, respectively, and we recall the definition of the matrix entries a q s 
from (]6.20p . to justify the following calculation: 

(Ar(M)) q = a iSr(S)= £ a T r(q(T))= £ a T f(T) 

sev{i) Tev(j) Tev(j) 

= a T r(M + e) = A 7 Br(M + e) = A T r(M) < 

where the last two steps used Lemma I6.7tlml) and Theorem 16.41 or 16.61 for a field F of even or odd 
characteristic, respectively. ■ 
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